Enterprise Database Systems
Analyzing Data Using Python
Analyzing Data Using Python: Analyzing Data Using Python: Filtering Data in Pandas
Analyzing Data Using Python: Cleaning & Analyzing Data in Pandas
Analyzing Data Using Python: Data Analytics Using Pandas
Analyzing Data Using Python: Importing, Exporting, & Analyzing Data With Pandas

Analyzing Data Using Python: Analyzing Data Using Python: Filtering Data in Pandas

Course Number:
it_daavlpdj_03_enus
Lesson Objectives

Analyzing Data Using Python: Analyzing Data Using Python: Filtering Data in Pandas

  • discover the key concepts covered in this course
  • look up data using different techniques
  • apply the loc and iloc functions to access specific rows and columns
  • filter data using the loc, iloc, at, and iat functions
  • filter data using the loc, iloc, at, and iat functions
  • perform conditional filtering using the query function
  • parse and manipulate datetime values
  • select and drop specific columns
  • apply regular expressions and other advanced techniques to select and drop columns
  • summarize the key concepts covered in this course

Overview/Description

Not all data is useful. Luckily, there are some powerful filtering operations available in pandas. The course begins with a detailed look at how loc and iloc can be used to access specific data from a DataFrame. You'll move on to filter data using the classic pandas lookup syntax and the pandas filter and query methods. You'll illustrate how the filter function accepts wildcards as well as regular expressions and use various methods such as the .isin method to filter data.

Furthermore, you'll filter data using either two pairs of square brackets - in which case the resulting subset is itself a DataFrame - or a single pair of square brackets, in which case the returned data takes the form of a Series. You'll drop rows and columns from a pandas DataFrame and see how rows can be filtered out of a DataFrame. Lastly, you'll identify a possible gotcha that arises when you drop rows in-place but neglect to reset the index labels in your object.



Target

Prerequisites: none

Analyzing Data Using Python: Cleaning & Analyzing Data in Pandas

Course Number:
it_daavlpdj_04_enus
Lesson Objectives

Analyzing Data Using Python: Cleaning & Analyzing Data in Pandas

  • discover the key concepts covered in this course
  • identify and deal with duplicate records
  • summarize records into bins or categories
  • compute aggregations on data
  • perform common grouping and aggregation operations
  • use pivot tables to explore data
  • use pivot tables to summarize data
  • combine and merge records
  • perform inner join operations using the merge() method
  • perform left and right join operations using the merge() method
  • implement joins using the join() method
  • manipulate and analyze time series data
  • summarize the key concepts covered in this course

Overview/Description

For data analysis to be useful and accurate, the analyzed data needs to be cleaned and curated. There are copious methods to achieve this in pandas. In this course, you'll learn how to identify and eliminate duplicates in pandas.

You'll start by using the pandas cut method to discretize data into bins, using bins to plot histograms and identify outliers using box-and-whisker plots. You'll parse and work with datetime objects read in from strings and convert string columns to datetime using the dateutils python library.

Moving on, you'll master different pandas methods for aggregating data - including the groupby, pivot, and pivot_table methods. Lastly, you'll perform various joins - inner, left outer, right outer, and full outer - using both the merge and join methods.



Target

Prerequisites: none

Analyzing Data Using Python: Data Analytics Using Pandas

Course Number:
it_daavlpdj_01_enus
Lesson Objectives

Analyzing Data Using Python: Data Analytics Using Pandas

  • discover the key concepts covered in this course
  • create a pandas Series object
  • perform basic operations on Series objects
  • cast data types within Series objects
  • filter data in Series objects using logical operations
  • create the tabular DataFrame object in pandas
  • look up data using DataFrame methods
  • create DataFrames from dictionaries and tuples
  • summarize the key concepts covered in this course

Overview/Description

Built on the Python programming language, pandas provides a flexible and open source tool for data manipulation. In this course, you'll develop the skills you need to get started with this library. You'll begin by installing pandas from a Jupyter notebook using pip.

Next, you'll instantiate a pandas object, including a Series and DataFrame, and practice several ways of instantiating Dataframes - for instance, from lists, dictionaries of lists, and tuples created from lists using the zip method.

You round out this course by performing filter operations on DataFrames using the loc and iloc operations - fundamental techniques used to access specific rows and columns. You'll use loc to identify rows based on labels and iloc to access rows based on the index offset position starting from 0.



Target

Prerequisites: none

Analyzing Data Using Python: Importing, Exporting, & Analyzing Data With Pandas

Course Number:
it_daavlpdj_02_enus
Lesson Objectives

Analyzing Data Using Python: Importing, Exporting, & Analyzing Data With Pandas

  • discover the key concepts covered in this course
  • import and export data in CSV files
  • import and export data using HTML and JSON files
  • serialize data to Excel and Pickle files
  • perform basic data manipulation operations on DataFrames
  • explore and edit data in DataFrames
  • sort records based on column values
  • compute basic statistics on data stored in DataFrames
  • compute statistical summaries on data stored in DataFrames
  • summarize the key concepts covered in this course

Overview/Description

You can analyze a myriad of data formats through pandas - all you need to know is how. In this course, you'll bring various data types into pandas and perform several operations on the data.

You'll practice using common file types such as CSV, Excel, JSON, and HTML through pandas. You'll not only learn how to open and read files of different types, but you'll also serialize objects and copy them to the in-memory clipboard.

You'll move on to perform various fundamental operations on DataFrame objects. Lastly, you'll learn to compute basic statistics, access metadata, and modify and sort data in rows.



Target

Prerequisites: none

Close Chat Live